Adaptive Robotics - Final Report Extending Q-Learning to Infinite Spaces

نویسندگان

  • Eric Christiansen
  • Michael Gorbach
چکیده

One of the drawbacks of standard reinforcement learning techniques is that they only operate when both the state and action spaces are finite. Q-learning is one such algorithm. We propose an extension of Q-learning to infinite state and action sets called CHAMPAGNE, using a simple “Local Expert” function approximation method. We then experimentally test the performance of the algorithm on several navigation tasks. The algorithm is able to successfully solve a T-maze in pyrobot after reasonable training. We present results from varying the input type, reinforcement delay, and maximum memory size for this algorithm.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Adaptive Agents for Artificial Life Domain

Adaptation and real time behavior becomes a necessity for agents that want to survive in Artificial Life environments. Agents are autonomous and therefore some form of unsupervised learning has to be used to improve and adapt their behavior in time. The neurodynamic reinforcement learning approach – Q-learning where Q-factors are represented using neural networks, overcomes problems of the clas...

متن کامل

طراحی پایدارساز PSS3B بر اساس الگوریتم KH و Q-learning برای میراسازی نوسانات فرکانس پایین سیستم قدرت تک‌ماشینه

The main purpose of this paper is to develop a supplementary signal using reinforcement learning (RL) to improve the performance of power system stabilizer (PSS). RL is one of the most important issues in the field of artificial intelligence and is the popular method for solving Markov decision procedure (MDP). In this paper, a control method is developed based on Q-learning and used to improve...

متن کامل

Mini/Micro-Grid Adaptive Voltage and Frequency Stability Enhancement Using Q-learning Mechanism

This paper develops an adaptive control method for controlling frequency and voltage of an islanded mini/micro grid (M/µG) using reinforcement learning method. Reinforcement learning (RL) is one of the branches of the machine learning, which is the main solution method of Markov decision process (MDPs). Among the several solution methods of RL, the Q-learning method is used for solving RL in th...

متن کامل

KLearn: Stochastic Optimization Applied to Simulated Robot Actions Final Report

Machine learning techniques and algorithms are prevalent in robotics, and have been used for computer vision, grasping, and legged walking. Reinforcement learning approaches have been developed over the past 15 years, with modern techniques using continuous action spaces for various robotic applications. Policy gradient learning allows various optimization techniques to quickly optimize robotic...

متن کامل

A Novel Robust Adaptive Trajectory Tracking in Robot Manipulators

In this paper, a novel adaptive sliding mode control for rigid robot manipulators is proposed. In the proposed system, since there may exist explicit unknown parameters and perturbations, a Lyapunov based approach is presented to increase system robustness, even in presence of arbitrarily large (but not infinite) discontinuous perturbations. To control and track the robot, a continuous controll...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008